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Abstract — An adaptive distributed space-time coding (DSTC) 
scheme is proposed for two-hop cooperative MIMO networks. 
Linear minimum mean square error (MMSE) receive filters and 
adjustable code matrices are considered subject to a power con- 
straint with an amplify-and-forward (AF) cooperation strategy. 
In the proposed adaptive DSTC scheme, an adjustable code 
matrix obtained by a feedback channel is employed to transform 
the space-time coded matrix at the relay node. The effects 
of the limited feedback and the feedback errors are assessed. 
Linear MMSE expressions are devised to compute the parameters 
of the adjustable code matrix and the linear receive filters. 
Stochastic gradient (SG) and least-squares (LS) algorithms are 
also developed with reduced computational complexity. An upper 
bound on the pairwise error probability analysis is derived and 
indicates the advantage of employing the adjustable code matrices 
at the relay nodes. An alternative optimization algorithm for the 
adaptive DSTC scheme is also derived in order to eliminate the 
need for the feedback. The algorithm provides a fully distributed 
scheme for the adaptive DSTC at the relay node based on the 
minimization of the error probability. Simulation results show 
that the proposed algorithms obtain significant performance 
gains as compared to existing DSTC schemes. 

Index Terms — Adaptive algorithms, space-time codes with 
feedback, cooperative systems, distributed space time codes. 



I. Introduction 

Cooperative multiple-input and multiple-output (MIMO) 
systems, which employ multiple relay nodes with antennas 
between the source node and the destination node as a dis- 
tributed antenna array, can obtain diversity gains by providing 
copies of the transmitted signals to improve the reliability 
of wireless communication systems (TJ. Among the links 
between the relay nodes and the destination node, cooperation 
strategies such as Amplify-and-Forward (AF), Decode-and- 
Forward (DF), Compress-and-Forward (CF) ||2] and various 
distributed space-time coding (DSTC) schemes J5], J4), ||5) 
can be employed. 

The use of distributed space-time codes (DSTC) at the 
relay node in a cooperative network, providing more copies 
of the desired symbols at the destination node, can offer 
the system diversity and coding gains to mitigate the in- 
terference. A recent focus on DSTC techniques lies on the 
design of delay-tolerant codes and full-diversity schemes with 
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minimum outage probability. An opportunistic DSTC scheme 
with the minimum outage probability is designed for a DF 
cooperative network and compared with the fixed DSTC 
schemes in J6), while in a novel opportunistic relaying 
algorithm is achieved by employing DSTC in an AF cooper- 
ative MIMO network. An adaptive distributed-Alamouti (D- 
Alamouti) space-time block code (STBC) design is proposed 
in for non-regenerative dual-hop wireless systems which 
achieves the minimum outage probability. DSTC schemes 
for the AF protocol are discussed in 191- tim . In J9], the 
GABBA STC scheme is extended to a distributed MIMO 
network with full-diversity and full-rate, while an optimal 
algorithm for the design of the DSTC scheme to achieve 
the optimal diversity and multiplexing tradeoff is derived 
in (10). A quasi-orthogonal DSTBC for cooperative MIMO 
networks is presented and shown to achieve full rate and full 
diversity with any number of antennas in ifTTI . In lfl2l . a new 
STC scheme that multiplies a randomized matrix by the STC 
matrix at the relay node before the transmission is derived 
and analyzed. The randomized space-time code (RSTC) can 
achieve the performance of a centralized space-time code in 
terms of coding gain and diversity order. 

Optimal space-time codes can be obtained by transmitting 
the channel or other useful information for code design back 
to the source node, in order to achieve higher coding gains 
by pre-processing the symbols. In ifTJl . the trade-off between 
the length of the feedback symbols, which is related to the 
capacity loss and the transmission rate is discussed, whereas 
in lfl4ll one solution for this trade-off problem is derived. The 
use of limited feedback for STC encoding has been widely 
discussed in the literature. In lfT31 . the phase information is 
sent back for STC encoding in order to maintain the full 
diversity, and the phase feedback is employed in lfl6l to 
improve the performance of the Alamouti STBC. A limited 
feedback link is used in IfTTI and lfl8ll to provide the channel 
information for the pre-coding of an orthogonal STBC scheme. 
Another limited feedback strategy has been considered for 
power relay selection in lfl9l . 

In this paper, we propose an adaptive distributed space-time 
coding scheme and algorithms for cooperative MIMO relaying 
systems. This work was first introduced and discussed in ll3D . 
We first develop a centralized algorithm with limited feedback 
to compute the parameters of an adjustable code matrix, which 
requires sending the adjustable code matrices back to the relay 
nodes after the optimization via a feedback channel that is 
modeled as a Binary Symmetric Channel (BSC). Then, adap- 



tive optimization algorithms are derived based on the MSE and 
the ML criteria subject to constraints on the transmitted power 
at the relays, in order to release the destination node from the 
high computational complexity of the optimization process. 
We focus on how the adjustable code matrix affects the DSTC 
during the encoding and how to optimize the linear receive 
filter with the code matrix iteratively or, alternatively, by 
employing an ML detector and adjusting the code matrix. The 
upper bound of the error probability of the proposed adaptive 
DSTC is derived in order to show its advantages as compared 
to the traditional DSTC schemes and the influence of the 
imperfect feedback is discussed. It is shown that the use of an 
adjustable code matrix benefits the performance of the system 
compared to employing traditional STC schemes. Then, we 
derive a fully distributed matrix optimization algorithm which 
does not require feedback. The pairwise error probability 
(PEP) of the adaptive DSTC is employed in order to devise a 
distributed algorithm and to eliminate the need for feedback 
channels. The fully distributed matrix optimization algorithm 
allows the system to use the optimal adjustable matrix before 
the transmission, and also achieves the minimum PEP when 
the statistical information of the channel does not change. The 
differences of our work compared with the existing works are 
discussed as follows. First, an optimal adjustable code matrix 
will be multiplied by an existing space-time coding scheme 
at the relay node and the encoded data are forwarded to the 
destination node. The code matrix is first generated randomly 
as discussed in |p"2| . and it is optimized according to different 
criteria at the destination node by the proposed algorithms. 
Second, in order to implement the adaptive algorithms, the 
adjustable code matrix is optimized with the linear receive 
filter iteratively, and then transmitted back to the relay node 
via a feedback channel. The impact of the feedback errors is 
considered and shown in the simulations. Third, the proposed 
fully distributed optimization algorithm eliminates the effect 
of the feedback by choosing the optimal code matrix before 
transmission, and the receiver is released from the design task. 

The paper is organized as follows. Section II introduces 
a two-hop cooperative MIMO system with multiple relays 
applying the AF strategy and the adaptive DSTC scheme. 
In Section III the proposed optimization algorithms for the 
adjustable code matrix are derived, and the pairwise error 
probability is analyzed in Section IV. The fully distributed 
optimization algorithm is derived in Section V, and the results 
of the simulations are given in Section VI. Section VII gives 
the conclusions of the work. 

Notation: the italic, bold lower-case and bold upper-case 
letters denote scalars, vectors and matrices, respectively. The 
operators E[-] and (-) H stand for expected value and the 
Hermitian operator. T he N x N i dentity matrix is written 

as I N . || X \\ F = \jTr(X H ■ X) = jTr(X ■ X H ) is the 
Frobenius norm. 5R[] and 3[] stand for the real part and 
the imaginary part, respectively. Tr(-) stands for the trace 
of a matrix, and (-y for pseudo-inverse, and (££) denotes the 
Kronecker product. 




Fig. 1. Cooperative MIMO system model with n r relay nodes 

II. Cooperative MIMO System Model 

The communication system under consideration is a two- 
hop cooperative MIMO system which employs multiple relay 
nodes as shown in Fig. 1. The first hop is devoted to the source 
transmission, which broadcasts the information symbols to 
the relay nodes and to the destination node. The second hop 
forwards the amplified and re-encoded information symbols 
from the relay nodes to the destination node. An orthogonal 
transmission protocol is considered which requires that the 
source node does not transmit during the time period of the 
second hop. In order to evaluate the adaptive optimization 
algorithms, a BSC is considered as the feedback channel. 

Consider a cooperative MIMO system with n r relay nodes 
that employ an AF cooperative strategy as well as a DSTC 
scheme. All nodes have TV antennas to transmit and re- 
ceive. We consider only one user at the source node in 
our system that operates in a spatial multiplexing configu- 
ration. Let s[i] denotes the transmitted information symbol 
vector at the source node, which contains N parameters, 
s[i] = [si[i] t S2[i],-;Stf[i]], and has a covariance matrix 
75[s[i]s ff [i]\ — ct^In, where a 2 s is the signal power which 
we assume to be equal to 1. The source node broadcasts s[i] 
from the source to n r relay nodes as well as to the destination 
node in the first hop, which can be described by 

r S D[i] = HsD[i]s[i}+n S D[i], r SRk [i\ = F SRk {i]s{i\+n SRk {i], 

(1) 
i = 1,2, ... ,N, k = 1,2, ... n r , 

where rsR k [i] and r,g£>[z] denote the received symbol vectors 
at the kth relay node and at the destination node, respectively. 
The N x 1 vector risR k [i] and Usd [i] denote the zero mean 
complex circular symmetric additive white Gaussian noise 
(AWGN) vector generated at the kth relay node and at the 
destination node with variance a 1 . The matrices F$R k [i] and 
H sd [i] are the N x N channel coefficient matrices between 
the source node and the kth relay node, and between the source 
node and the destination node, respectively. 

The received symbols are amplified and re-encoded at each 
relay node prior to transmission to the destination node in the 
second hop. We assume that the synchronization at each node 
is perfect. After amplifying the received vector rsn k [i] at the 
kth relay node, the signal vector ssR k [i] = A RkD [i]r S R k [i] 
can be obtained, where A Rk £, [i] stands for the NxN diagonal 
amplification matrix assigned at the fcth relay node. The JVxl 
signal vector ssR k M will be re-encoded by an N x T DSTC 
scheme M(s), multiplied by an N x N adjustable code matrix 



*&k[i] generated randomly 0121 . and then forwarded to the 
destination node. The relationship between the fcth relay and 
the destination node can be described as 

R RkD [i\ = G RkD [i\^ k [i\M RkD [{\ + N RkD [i\. (2) 

The N x T received symbol matrix R Rk n\i\ in © can be 
written as an NT x 1 vector r Rk u[i] given by 



r Rk n[i\ = ®eq k \i}G eq .[i]s SRk [i] + n RkD [i], 



(3) 



where the block diagonal NT x NT matrix 3> eqk [i] denotes 
the equivalent adjustable code matrix and the NT x N 
matrix G eqk [i] stands for the equivalent channel matrix which 
is the DSTC scheme Af(s[i]) combined with the channel 
matrix G Rk r>[i\. The NT x 1 equivalent noise vector n Rk o[i] 
generated at the destination node contains the noise parameters 
in N RkD [i\. 

The use of an adjustable code matrix or a randomized 
matrix & eqk [i] which achieves the full diversity order and 
provides a lower error probability has been discussed in [1121 . 
The uniform sphere randomized matrix which achieves the 
lowest BER of the analyzed schemes and contains elements 
that are uniformly distributed on the surface of a complex 
hyper-sphere of radius p is used in our system. The proposed 
adaptive algorithms detailed in the next section optimize the 
code matrices employed at the relay nodes in order to achieve 
a lower BER. At each relay node, the adjustable code matrices 
are normalized so that no increase in the energy is introduced 
at the relay nodes and the comparison between different 
schemes is fair. 

After rewriting R Rk n[i] we can consider the received 
symbol vector at the destination node as a (T + l)iV x 1 
vector with two parts, one is from the source node and another 
one is the superposition of the received vectors from each 
relay node. Therefore, the received symbol vector for the 
cooperative MIMO system can be written as 



r\i\ = 



Hs D [i]s[i] 

\i}Geq k [i]s SRk [i\ 

D D [i]s D [i] +n D [i], 



Hk=l *eq fc 



nsD [i] 

TIrdU] 



(4) 



where the (T + 1)N x 2A^ block diagonal matrix Do[i] 
denotes the channel gain matrix of all the links in the network 
which contains the N x N channel coefficients matrix H$d [i] 
between the source node and the destination node, the NTxN 
equivalent channel matrix G eqk [i] for k = 1, 2, ..., n r between 
each relay node and the destination node. We assume that 
the coefficients in all channel matrices are independent and 
remain constant over the transmission. The (T + l)iV x 1 
noise vector tid[*| contains the equivalent received noise 
vector at the destination node, which can be modeled as 
an AWGN with zero mean and covariance matrix o~ 2 (l+ | 
Efc=i $eq k \i}G eqk [i]A RkD \i] \\ 2 F )I {T+1)N . 

III. Joint Adaptive Code Matrix Optimization and 
Receiver Design 

In this section, we jointly design an MMSE adjustable 
code matrix and the receiver for the proposed DSTC scheme. 
Adaptive SG and RLS algorithms ll20l for determining the 



parameters of the adjustable code matrix with reduced com- 
plexity are also devised. The DSTC scheme used at the 
relay node employs an MMSE-based adjustable code matrix, 
which is computed at the destination node and obtained by a 
feedback channel in order to process the data symbols prior 
to transmission to the destination node. It is worth to mention 
that the code matrices are only used at the relay node so the 
direct link from the source node to the destination node is not 
considered in this section. 

A. Linear MMSE Receiver Design with Adaptive DSTC Opti- 
mization 

The linear MMSE receiver design with optimal code matri- 
ces is derived as follows. By defining the TN x 1 parameter 
vector Wj[i] to determine the jth symbol Sj[i], we propose 
the MSE based optimization with a power constraint at the 
destination node described by 



■ e«3fc 



i]] = arg min E [\\ Sj \i] - wf [i]r[i]\\ 2 ] , s.t. Tr(* c 



*»:([*] >*egt.[' 



(5) 

where r[i] denotes the received symbol vector at the desti- 
nation node. By employing a Lagrange multiplier A we can 
obtain the Lagrange expression shown as 

if = J B[|| Sj H-<[z]r[z]|| 2 ]+A(Tr(* oqk [i]*f qk [i])-P R ). 

(6) 

By expanding the right-hand side of © and taking the 

gradient with respect to w* [i] and equating the terms to zero, 

we can obtain the jth MMSE receive filter vector for the jth 

symbol 

w j [i]=R- 1 p, (7) 

where the first term R = E [r[i]r H [z]] denotes the auto- 
correlation matrix and the second term p = E [r[i]s*[i]\ 
stands for the cross-correlation vector. To optimize the code 
matrix 3> e9fc ,[i] for each symbol at each relay node, we can 
calculate the code matrix by taking the gradient with respect 
to 3?L fc [*] an d equating the terms to zero, resulting in 



■eq kj 



R 1 P, 



where R = E 



sMssn^^Wjiilw^i] + XI 



(8) 



and P = 



E 



s jWs Rki [i]w 3 [i\g% k 



value of the Lagrange multip 



are NT x NT matrices. The 
ier A can be determined by sub- 
stituting & eqk . [i] into ATr($ eqk [i]*f qk [i]) = P R and solving 
the power constraint function. In the proposed adaptive algo- 
rithm we employ quantization instead of using the Lagrange 
multiplier, which requires less computational complexity. The 
detailed explanation is shown in the next section. Note that 
non-linear detection algorithms J26l can also be employed at 
the receiver for an improved performance. 

Appendix A includes a detailed derivation of Wj[i] and 
$ eg . [i]. The power constraint can be enforced by employ- 
ing the Lagrange multiplier and by substituting the power 
constraint into the MSE cost function. In (J8j a closed-form 
expression of the code matrix 3> eqk [i] assigned for the jth 
received symbol at the fcth relay node is derived. The problem 
is that the optimization method requires the calculation of 



TABLE I 
Summary of the C-ARMO SG Algorithm 



1 
2 
3 
4 

5 
6 
7 

8 
9 

10: 



Initialize: Wj [0] = 



JVTxl, 



According to ( [10) . the receive filter Wj[i] and the code 
depend on each other. Therefore, alternating 
can be used to determine the 



matrix <I? 



'■'II' : 



optimization algorithms 



<£• [0] is generated randomly with the power constraint Tri(*fe^ k ffirajSji<rePeiye filter and the code matrix iteratively, and 
For each instant of time, i=l, 2, ..., comphfceoptimization procedure can be completed. The complexity 

of calculating the optimal Wj[i] and & eqk .[i] is O(NT) and 
0{N 2 T 2 ), respectively, which is much less than 0(N 4 T 4 ) 
and 0(N 5 T 5 ) by using © and (HJ. As mentioned in Section 
I, the optimal MMSE code matrices will be sent back to the 
relay nodes via a feedback channel, and the influence of the 
feet feedback is shown and discussed in simulations. 



* 



VJSf„.| 

J ' 

where 6j[i] 
Update Wj 

Wj [i + 1 
[i + 11 = * 



-ej[ijr[ij 



}\Sj[l\Wj 



Wj\i\r\i] 



and ®eq k] [l\ 



by 



Wi I 



eik. 



p(e 



I'll, 



[i + l] = 



/P^*e 



v jL J / u ' lmper: 



m 



Ji+l] 



/Ef=i Tr(# C q k . [i+i]*? qk . [i-£L])AfL Detection and LS Code Matrix Estimation Algorithm 



a matrix inversion with a high computational complexity of 
0((iVT) 3 ), and with the increase in the number of antennas 
employed at each node or the use of more complicated STC 
encoders at the relay nodes, the computational complexity 
increases cubically according to the matrix sizes in (|7) and 
®. 

B. Adaptive Stochastic Gradient Optimization Algorithm 

In order to reduce the computational complexity and achieve 
an optimal performance, a centralized adaptive robust matrix 
optimization (C-ARMO) algorithm based on an SG algorithm 
with a linear receiver design is proposed as follows. 

The Lagrangian resulting from the optimization problem is 
derived in ©, and a simple adaptive algorithm for determining 
the linear receive filters and the code matrices can be derived 
by taking the instantaneous gradient term of (|6) with respect 
to w* [i] and with respect to <fr* [i], respectively, which are 



The criterion for optimizing the adjustable code matrices 
and performing symbol detection in the C-ARMO algorithm 
can be changed to the maximum likelihood (ML) criterion, 
which is equivalent to a Least-squares (LS) criterion in this 
case. For example, if we take the ML instead of the MSE 
criterion to determine the code matrices, then we have to store 
an N x D matrix S at the destination node which contains all 
the possible combinations of the transmitted symbol vectors. 
The ML optimization problem can be written as 



&,[*],* 



e-lk. 



= arg 



mm 



['I 



|rW-rH|| 2 , S .i.Tr($ cqk [i]*f qk [i] 



VJ2f». 

"Ik 

where Cj [i] = Sj [i 
and the NT X 1 



= VE[\\ Sj \ 
= VE[\\ Sj \ 



— Wj [i\r[i\ 

— wf[i]r[i] 



-e 5 [i\r[i\ 



(9) 
— to^[i]r[i] stands for the jth error signal, 
vector dk [i] denotes the jth column of 
the channel matrix which contains the product of the channel 
matrices FsR, k and Gn k D and the power allocation matrices 
AR k D- After we obtain (0 the proposed algorithm can be 
obtained by introducing a step size into a gradient optimization 
algorithm to update the result until the convergence is reached, 
and the algorithm is given by 



(11) 
where f[i] = YZ=x £j=i *e gtj [i]d kj [i]s dj [i] denotes the 
received symbol vector without noise which is determined by 
substituting each column of S into ([Til l. It is worth to mention 
that the optimization algorithm contains a discrete part which 
refers to the ML detection and a continuous part which refers 
to the optimization of the code matrix, and the detection and 
the optimization can be implemented separately as they do 
not depend on each other. The optimization algorithm can be 
■ur^rjs^feSd as a mixed discrete-continues optimization. In this 
case, other detectors such as sphere decoders can be used in 
the optimization algorithm in the detection part in order to 
reduce the computational complexity without an impact on 
the performance, and the algorithm will converge after several 
iterations. 

After determining the transmitted symbol vector, we can 
calculate the optimal code matrix $ 



i-'ik . 



by employing the 
LS estimation algorithm. The Lagrangian expression is given 
by 



[*]+/8(e;[t]r[*]), 



*, 



A' 



where /3 and p denote the step sizes in the recursions for the 
estimation. A detailed derivation is included in Appendix B. 
The energy of the code matrices in (TTOl will be increased 
with the processing of the adaptive algorithm, which will 
contribute to the reduction of the error probability. A normal- 
ization of the code matrix after the optimization is required and 

implemented as v eqk . [i + 1J 



[*+l] = ®e. qk] [i]+li{ej[i]^[i^Mdltm^& 

do) VUK hh 



eq h Ai\d ki [{\s d:j [i\ 



-A(rr[* egt [i]* 



■ elk I 



(12) 

and by taking the instantaneous gradient of _S? with respect to 
the code matrix 4>* [i] we can obtain 



VJSfa 



{'] 



(r\i] 

(r ej \ 



r[»])V S| 



■eqfc. 



■Ik 

i]d 



i\(r[i] 

kAi]sd, 






[i+i]*f [i+i]) 



^Ef =1 Tr(# oqkj 

to ensure that the energy is not increased and for a fair 
comparison among the analyzed DSTC schemes. A summary 
of the C-ARMO SG algorithm is given in Table I. 



[«D, 

(13) 

r Ej L*J n*j - Efe=i EfcUiyy ^^k t [<]dk, [i]sd, [*] 
stands for the received vector without the desired code matrix. 



where 



The optimal code matrix 4> egfc [i] requires V^f$ 



0, 



and the optimal adjustable code matrix as given by 



TABLE II 

Summary of the C-ARMO RLS Algorithm 



*, 



W<M 



SdAt 



[*]| 2 dfc,[t]d?[t])t. (14) 



The power constraint is not considered because the quanti- 
zation method can be employed in order to reduce the high 
computational complexity for determining the value of the 
Lagrange multiplier. 



D. RLS Code Matrix Estimation Algorithm 

The RLS estimation algorithm for the code matrix & e q k [i] 
is derived in this section. The ML detector is employed so 
that the detection and the optimization procedures are separate 
as explained in the previous section, so we focus on how 
to optimize the code matrix rather than the detection. The 
superior convergence behavior of the LS algorithm when the 
size of the adjustable code matrix is large indicates the reason 
of the utilization of an RLS estimation, and it is worth to 
mention that the computational complexity reduces from cubic 
to square by employing the RLS algorithm. 

According to the RLS algorithm, the optimization problem 
is given by 



1 
2 
3 
4 

5 

6 

7 
8 

12 



Initialize: P[0] — S~ l Intxnt, Z[0] = Intxni 
the value of S is small when SNR is high and is large when 
$[0] is generated randomly with the power constraint trace(4> C q k 
For each instant of time, i=l, 2, ..., compute 

K W ~ i+a-i^.w*- 1 ^-^^.^]' 
* eqkj [*] = * e9fcj [i - 1] + A- Ve[i] - Z[i - l]fc[i])r£. [*] 
P[i] = A _1 P[i - 1] - A-^^rf, [i]P[i - 1], 



*, 



Z[i] = \Z[i 



l\+r e [i\r\ 



I ■ 



VE^xTrt^^H^.ti])- 



by 

* eqk . [i] = \Z[z - l]P\i] + rMrl \i]P\i] 

= Z[i - l]P[i - 1] + Z[i - l]k\i)r%. \i]P[i - 1] + r e [t]r£ [%\P 

= * eg [i- 1] + \-\r e [i\ - Z[i - l]k\i])r%\i]P[i - 1]. 

(21) 
Table II shows a summary of the C-ARMO RLS algorithm. 



E. Convergence Analysis 

The C-ARMO algorithms can be divided into two cases: 
the first one performs the optimization by updating the receive 



[^ [*]] = arg _nrfn n £ A^||r[n]-f [i\f, s.t. ^(*c*[i]*g^^£ ™* ^^ U ' 



['] 



(15) 
where A stands for the forgetting factor. By expanding the 
right-hand side of (fl31 l and taking gradient with respect to 
3>* [i] and equaling the terms to zero, we obtain 



*e 9fcj [t] = (]T A^eHrf . [n])(£ A*" w r fci [n]r£, [n])" 1 , 

n— 1 2—1 

(16) 
where the 7VT x 1 vector r e [n] = <fr eqk .[n]dk[n]sd[n] 
and rjfe. [n] = djb[n]s<j. [n]. The power constraint is still not 
considered during the optimization. We define 

i 

*[*] = E Ai ""^ M< N = A*[* - 1] + r kj [n]r H k] [n], 

n=l 

(17) 

i 

Z[i] = Y J ^ n re[n]r H kj [n] = AZ[i-l]+r e [n]rf.[n], (18) 

n=l 

so that we can rewrite (fTol as 

$ e% .[z] = Z[z]*- 1 [i]. (19) 

By employing the matrix inversion lemma in 11301 , we can 
obtain 

* _1 [i] = A _1 * _1 [i - 1] - A -1 Jfe[i]rj^ [*]* _1 [* - 1], (20) 

where k[i] = (A -1 * -1 ^ - l]r fc ,[i])/(l + A _1 r^[i]* -1 [i- 
l]r^[i]). We define P[i] = * _1 [i] and by substituting ( fT8l 
and ( f20b into ( fT9l , the expression of the code matrix is given 



second one only optimizes the code 
matrix itself according to the Lagrangian function, i.e., the 
ML and RLS based C-ARMO algorithms. In this subsection, 
we will illustrate how the C-ARMO algorithms converge to 
the global optimum solution. 

1) MSE based C-ARMO algorithm: The proposed MSE 
based C-ARMO algorithm allows the optimization of the 
receive filter w[i] and the code matrix <&[i] iteratively. A 
detailed proof of the convergence of this type of algorithm 
is derived in 11321 . We will give a brief outline on how these 
results can be used to prove the convergence of our algorithms. 

According to 1321 . the optimization problem in (0 can be 
described as: Given an initial (wq, 3?o) £ Wo x Vq, we have 
to find a sequence of points (w n , $„) € W n x V n that 



lim j2f(to n) * n )=J&f(W,7>), 



(22) 



where the sequence of compact sets {(W n ,'P n )}n>o ■ Wnj'Pn 
that are revealed at time n such that as n — > oo, V n -? V and 
W n H W, and 

dniA, B) = maxjsup inf d(A, B), sup inf d(A, B)\ 

AeA B( ^ B BeB A ^A 

(23) 
denotes the Hausdorff distance between A and B. The pro- 
posed algorithm is written recursively for n > 1 as described 

by 

w n e arg min jSf(«j, * n _i), *„ € arg min Jz? («>„,<&). 

(24) 
According to the three-point property and the four-point prop- 



erty in 1021 , we can obtain 

j2f(to„, *„)+jSf(ti>, *„) < if (to, *„_i)+jSf(w, *)+w(7n) 

(25) 
for all w G W„ and & £ V n , where w(7„) denotes the 
modulus of continuity of j£f (tu n , <& n ) with w(7 n ) — > as 
7„ — > 0, and j n = s n + e„_i with e„ — > as n — >• oo. 

Since V n -$V and W„ -? W, there must exists a sequence 
(w' n ,& n ) G W„ x -P„ such that (w' n ,& n ) -> (to',*') G 
argmin if (W,-P) and d(u^,u>') + d(*' re) *') < e„ for all 
n > 0. By replacing (w,<&) with («?',$') and choosing 
if (toj,, *JJ < if (ti/, $') + w(s„), we can obtain 



in order to concentrate on the effects of the adjustable code 
matrix on the performance. The expression of the upper bound 
holds for systems with different sizes and an arbitrary number 
of relay nodes. 

Consider an TV x N STC scheme at the relay node with T 
codewords, and the codeword C 1 is transmitted and decoded 
as another codeword C l at the destination node, where i = 
1, 2, ..., T. According to J2T|, the probability of error for this 
code can be upper bounded by the sum of all the probabilities 
of incorrect decoding, which is given by 



&{w n , & n )+J?(w' n , * n ) < ^ «_!, *„_!)+_£?(«/, ^')+2u( ln )+uj(e n ) 1 

(26) 
and further derive 



p c <j2 p ( cl ^ ct y 



(30) 



limirrf if (>„, * n ) < Jf(w, *). 



(27) 



By defining a subsequence {rik}k>o such that 
liminfn-Kx, if («>„,*„) = lim^oo S£ (w nk , *„ J, and 
assuming compactness of W and V, we can obtain w G W , 
$eP, and 



lim J2f(t»„ fc> * n j> if («;,*). 

(28) 
obtain 



lim inf if (w n ,* n ) 

Combining ( |27] | and (1281 1. we can 

lim infn^oo if (w n , $„) = if(^(;,4 , ) which indicates 

(l22l converges to the optimum values. 

2 J ML and RLS based C-ARMO algorithm: The ML and 
RLS based C-ARMO algorithms just optimize the code matrix, 
and we can analyze the Hessian matrix of ( fT2l and check its 
positive (semi-)definiteness. By taking the second-order partial 
derivatives of the Lagrangian cost function in (fT2l . we can 
obtain 

8 r) V r) 



d® c 



lfl *e, fc J* 



a* e 






where the first term 



(29) 
| 2 is a positive scalar and the 



rest of the terms denotes the multiplication of the equivalent 
channel vectors which is a positive-definite matrix and the 
problem is convex. We conclude that the Hessian matrix of 
the Lagrangian cost function is a positive-definite matrix so 
that the ML and RLS based C-ARMO algorithms converge to 
the global optimum under the usual assumptions used to prove 
the convergence of these algorithms for convex problems. 

IV. Probability of Error Analysis 

In this section, the pairwise error probability (PEP) of the 
system employing the adaptive DSTC will be derived. As we 
mentioned in Section I, the adjustable code matrices will be 
considered in the derivation as it affects the performance by 
reducing the upper bound of the pairwise error probability. The 
PEP upper bound of the traditional STC schemes in lETI is 
introduced for comparison, and the main difference lies in the 
eigenvalues of the adjustable code matrices. Please note that 
the direct link is ignored in the PEP upper bound derivation 



Assuming that the codeword C 2 is decoded at the destination 
node and that we know the channel information perfectly, we 
can derive the conditional pairwise error probability of the 
DSTC encoded with the adjustable code matrix $ as ll33ll 

PIC 1 -► C 2 I *) = Q Ul || *£>(C 1 - C 2 ) || F ) , (31) 

where D stands for the matrix with the channel coefficients 
for all links. Let U H AcU be the eigenvalue decomposition 
of (C 1 - C 2 ) W (C 1 - C 2 ), where U is a unitary matrix 
with the eigenvectors and Ac is a diagonal matrix which 
contains all the eigenvalues of the difference between two 
different codewords C and C . Let V A&V stand for the 
eigenvalue decomposition of (<&DU) H &DU, where V is 
a unitary matrix that contains the eigenvectors and A# is a 
diagonal matrix with the eigenvalues arranged in decreasing 
order. Therefore, the conditional pairwise probability of error 
can be written as 
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i^r*e 9fcJ [*H i [*1<[t]x\ 

'(32) 

where £ nm is the (n, m)th element in V, and A* n and Xc„ 
are the nth eigenvalues in A$ and A s , respectively. It is 
important to note that the value of A# is positive and real 
because (&DU) H $>DU is Hermitian symmetric. According 
to I2TI . an appropriate upper bound assumption of the Q 
function is Q(x) < \e~^~ , thus we can derive the upper bound 
of the pairwise error probability for an adaptive STC scheme 



Pe* <E 



r/Y 



/ NT N N 

2 exp -JEE a *" a cJ^,™i 2 , 

Z V m=l„=l )\ lL=l( 1 +4 A *- / 

(33) 

while the upper bound of the error probability expression for 
a traditional STC in [21] is given by 

NT N \ "I 

EE Ac "l^™l 2 = n w n + n^ ) NT ' 

(34) 

If we neglect the 1 in the denominator in (|33l , the exponent 
of the SNR 7 indicates the diversity order which means the 
full diversity NTN can be achieved in ( T33T >. By comparing 
d33l and d34b . employing an adjustable code matrix for an 



Pe D <E 



■ cxp 



2 

4 



STC scheme at the relay node introduces A# n in the BER 
upper bound. With the aid of simulations, we found that A# 
is diagonal with one eigenvalue less than 1 and others much 
greater than 1. As a result, employing the adjustable code 
matrices can provide a decrease in the BER upper bound since 
the value in the denominator increases. 



V. The fully distributed adaptive robust matrix 

OPTIMIZATION ALGORITHM 



Inspired by the analysis developed in the previous section, 
we derive a fully distributed ARMO (FD-ARMO) algorithm 
which does not require the feedback channel in this section. 
We will extend the exact PEP expression in |J29] for MIMO 
communication systems to the AF cooperative MIMO systems 
with the adaptive DSTC schemes. Then, we design the FD- 
ARMO algorithm to determine and store the adjustable code 
matrices at the relay nodes before the transmission in Phase 
II. 

The exact PEP expression of an STC has been given by 
Taricco and Biglieri in ||29l , which contains the sum of the 
real part and the imaginary part of the mean value of the 
error probability, and the moment generating function (MGF) 
is employed to compute the mean value. To extend the exact 
PEP expression to the cooperative MIMO systems, we can 
rewrite the received symbol vector at the destination node as 

R RD = J2 *k[i]D k [i]C[i] + N RD [i\, 
fc=i 

where Dk[i] denotes the channel matrix. For simplicity, we 
assume the synchronization is perfect, and each relay node 
transmits the STC matrix simultaneously and the received 
symbol vector at the destination node will be the superposition 
of each column of each STC code. The equivalent noise vector 
contains the AWGN at the destination node as well as the 
amplified and re-encoded noise vectors at the relay nodes. As a 
result the PEP expression of the AF cooperative MIMO system 
with the adaptive DSTC can be derived as 

$D{C l C 2 ) || 



P(C X -> c 2 1 * e 



Q 



V2N~o 



(35) 



where Nq 



Tr(J + 4>D) denotes the received noise 



variance at the destination node. We define AG 

C as the distance between the code words, and r = 

v <&DAA H D H <& H and we assume that the eigenvalue 

decomposition of AA B can be written as VAV H , where V 
stands for a unitary matrix that contains the eigenvectors of 
AA and A contains all the eigenvalues of the square of 
the distance vector. Define an N x N matrix Z = &D, 
and Z ~ N c (fj,z>^z), where fxz = denotes the mean 
and Hz = E [SzS^] stands for the covariance matrix. The 



TABLE III 
Summary of the FD-ARMO Algorithm 



Choose the N x T STC scheme used at the relay node 
Determine the dimension of the adjustable code matrix <I> which is 
Compute the eigenvalue decomposition of AA ff and store the res 
Generate a set of 4> randomly with the power constraint Tr(<&k^i 

For all <I>, compute 



9(c) = dct I 



= *A*' 



6: 



Choose the code matrix according to 
©qpt(c) = argmax; 0j(c) 
7: Store the optimal code matrix & op t at the relay node 

expression of the error probability is given by 

[$DAA ff D H * H ]) 



6(c) = E [exp(-c£)] =E 



E 



exp 



2N 



exp(— c 

[ZAZ H ] 
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2ivy 



exp(-n%B(I + Z z B) 



V>3. 



dct [I 



2^/2No 



$A$« 



det I 



= *A*' 



2 v / 2Ao 

(36) 

where B = I (£) AA ff , and c = a + jb is the variable defined 

in the MGF with a = \ and b is a constant. By inserting (l36*l l 

into the pairwise error probability expression in [E9l . we can 

obtain the exact PEP of the adaptive DSTC scheme written as 

1 J h 

t=i 
where Ej — > as J — > oo. 

Since the PEP is proportional to (l36l . it is clear that 
minimizing the PEP is equal to maximizing the determinant 
<I>A<I> W . As a result, the optimization problem 



of I 



2\/2JVc 

can be written as 



Q op t{c) = argmax6/(c), 



1 = 1,2,... 



(38) 



where 0/(c) stands for the Zth candidate code matrix. For 
simplicity the candidate code matrices are generated randomly 
and satisfy the power constraint. In order to obtain the ad- 
justable code matrix we can first randomly generate a set of 
matrices, and then substitute them into (l36l l to compute the 
determinant. In the simulation, we randomly generate 500 code 
matrices and choose the optimal one according to the FD- 
ARMO algorithm. The optimal code matrix with the largest 
value of the determinant which achieves the minimal PEP 
will be employed at the relay node. A summary of the FD- 
ARMO is given in Table III. It is worth to mention that the 
FD-ARMO algorithm is non-convex, but it can still achieve 
the optimal performance by choosing the optimal code matrix 
from a number of candidates even though this not guaranteed. 

VI. Simulations 

The simulation results are provided in this section to assess 
the proposed scheme and algorithms. The cooperative MIMO 
system considered employs an AF protocol with the Alamouti 
STBC scheme in J5) using QPSK modulation in a quasi-static 
block fading channel with AWGN. The effect of the direct link 




© - ! .IMi" bound III I) AI.iI'ii.Ii 
+ - IJ|i3er bound 1:1 H AlHIllouli 

- Ufioei bound tit C ARMQ Allium.. 

D Alamouti ML 

R Alamouti ML 

C AHVIO Ad'iiouli m 




SM-linear MMSE 
D-Alamauti-linear MMSE[26] 

SSD PDST8C li'iHMf MMSE|37] 
R Alamou^ ijnea'UMSE[l2] 
C Ai'Vio Alamouti SG meat MMSE 

lull 0:i|>:i - ;.i"ist<: AlaivouN i-h;« VIVISI 7 



Fig. 2. BER Performance vs. SNR for the Upper Bound of the Alamouti 
Schemes without the Direct Link 

is also considered. It is possible to employ the DF protocol or 
use different number of antennas and relay nodes with a simple 
modification. The system is equipped with n r = 1 relay node 
and N = 2 antennas at each node. In the simulations, we set 
the symbol power 07 as equal to 1 , and the power of the ad- 
justable code matrix in the ARMO algorithms are normalized. 
The SNR in the simulations is the received £NR which is 
calculated by SNR = ,, n r«}¥= 1 n'A k ®n{ F rW^- 

The upper bounds of the D-Alamouti, the R-Alamouti 
in H2 and the adaptive Alamouti STC in C-ARMO RLS 
algorithm are shown in Fig. 2. The theoretical pairwise error 
probabilities provide the largest decoding errors of the three 
different coding schemes and as shown in the figure, by 
employing a randomized matrix at the relay node it decreases 
the decoding error upper bound. The bounds become tighter to 
the respective coding schemes as the SNR increases. The com- 
parison of the simulation results in a better BER performance 
of the R-Alamouti and the D-Alamouti which indicates the 
advantage of using the randomized matrix at relay nodes. The 
C-ARMO RLS algorithm optimizes the randomized matrices 
after each transmission which contributes to a lower error 
probability upper bound, and the ML detection algorithm 
provides the optimal performance at the cost of a higher 
computation complexity. 

The proposed C-ARMO SG algorithm with a linear MMSE 
receiver is compared with the SM scheme and the DSTC 
algorithms in El, El, El and in Fi g- 3. It is worth 
to mention that the coding schemes in the simulations are 
different. In the proposed algorithm and the algorithms in 
l34l . ll35ll and lfT2l . a spatial multiplexing scheme is sent from 
the source node and re-encoded at the relay node, while the 
full-opportunistic code (7) requires an STC encoding at the 
source node instead of re-encoding at the relay node. The 
step sizes for the iterative optimization are j3 — 0.01 and 
/i = 0.03, which are chosen according to ll36l . The results 
illustrate that without the direct link, by making use of the 
STC technique, a significant performance improvement can 
be achieved compared to the spatial multiplexing system. The 
RSTC algorithm in lfl2l outperforms the STC-AF schemes 
in El and El, while the C-ARMO SG algorithm can 
improve the performance by about 3dB as compared to the 
RSTC algorithm. The STC scheme in Q achieves a much 
better performance compared to other schemes although this 
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Fig. 4. BER Performance vs. SNR for C-ARMO RLS Algorithm with and 
without the Direct Link 

comparison must be considered with caution. In Q the 
standard 2x2 Alamouti STBC is employed at the source 
node, which indicates the received matrix at the relay node 
is amplified without the interference to the orthogonality of 
the code. Moreover, encoding at the source node requires 
more time slots to transmit so that the transmission rate is 
half compared to the proposed C-ARMO algorithm. It is 
also worth to mention that the C-ARMO algorithm can be 
employed in an opportunistic scheme to achieve a better BER 
performance as both of the algorithms employ the STCs and 
can perform the optimization at the destination node. With the 
consideration of the direct link, the results indicate that the 
diversity order can be increased, and using the C-ARMO SG 
algorithm an improved performance is achieved with 2dB of 
gain as compared to employing the RSTC algorithm in lfT2l 
and 3dB of gain as compared to employing the traditional 
STC-AF algorithm in El- 

In Fig. 4, BER curves of different Alamouti coding schemes 
and the proposed C-ARMO RLS algorithm with and with- 
out the direct link using an ML detector are compared. In 
Fig. 4, the R-Alamouti scheme improves the performance 
by about 4dB without the direct link compared to the D- 
Alamouti scheme, and the C-ARMO RLS algorithm provides 
a significant improvement in terms of gains compared to the 
other DSTC schemes. When the direct link is considered, 
all the coding schemes can achieve the full diversity order 
and obtain lower BER performances compared to that without 
the direct link, and still the C-ARMO RLS algorithm which 
optimizes the adjustable code matrix achieves the lowest BER 
performance. 
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Fig. 5. BER Performance vs. Number of Samples for C-ARMO SG 
Algorithm without the Direct Link 

The simulation results shown in Fig. 5 illustrate the con- 
vergence property of the C-ARMO SG algorithm. All the 
schemes have an error probability of 1 at the beginning, and 
after the first 20 symbols are received and detected, the R- 
Alamouti scheme in Il34l achieves a better BER performance 
compared with the spatial multiplexing scheme and the R- 
Alamouti scheme in lfl2l can reach a lower BER than the 
C-ARMO algorithm. With the number of received symbol 
increasing, the BER curve of the spatial multiplexing, the D- 
Alamouti and the R-Alamouti schemes are almost straight, 
while the BER performance of the C-ARMO algorithm can be 
further improved and obtain a fast convergence after receiving 
140 symbols. 

The simulation results shown in Fig. 6 illustrate the influ- 
ence of the feedback channel on the C-ARMO SG algorithm. 
As mentioned in Section I, the optimized code matrix will 
be sent back to each relay node through a feedback channel. 
The quantization and feedback errors are not considered in 
the simulation results in Fig. 3 and Fig. 4, so the optimized 
code matrix is perfectly known at the relay node after the 
C-ARMO algorithm; while in Fig. 6, it indicates that the 
performance of the proposed algorithm will be affected by 
the accuracy of the feedback information. In the simulation, 
we use 4 bits to quantize the real part and the imaginary 
part of the element in the code matrix Q eqk \i\, and the 
feedback channel is modeled as a binary symmetric channel 
with different error probabilities. As we can see from Fig. 6, 
by decreasing the error probabilities for the feedback channel 
with fixed quantization bits, the BER performance approaches 
the performance with the perfect feedback, and by making 
use of 4 quantization bits for the real and imaginary part of 
each parameter in the code matrix, the performance of the C- 
ARMO SG algorithm is about ldB worse with feedback error 
probability of 10~ 3 . 

In Fig. 7, we plot the average error probability with respect 
to the SNR for the FD-ARMO algorithm and the C-ARMO 
SG algorithm with perfect feedback. The C-ARMO curve 
and the FD-ARMO curve outperforms the others because 
they optimize the adjustable code matrices with the same 
criterion, but ldB of gain has been obtained by the C-ARMO 
SG algorithm because the exact adjustable code matrix is 
transmitted back to the relay node in a delay-free and error- free 
feedback channel. While the FD-ARMO chooses the optimal 



Fig. 6. BFR Performance vs. SNR for C-ARMO Algorithm with Perfect 
and Im 




Fig. 7. Full-Distributed ARMO Algorithm and C-ARMO SG Algorithm 

adjustable code matrix by using the statistical information of 
the channel before transmission so that the performance will 
be influenced, resulting in a gain less than ldB. 

VII. Conclusion 

We have proposed centralized adaptive robust matrix opti- 
mization (C-ARMO) algorithms for the AF cooperative MIMO 
system using a linear MMSE receive filter and an ML receiver 
at the destination node. The pairwise error probability of in- 
troducing the adaptive DSTC in a cooperative MIMO network 
with the AF protocol has been derived. In order to eliminate 
the need for a feedback channel we have derived a fully- 
distributed ARMO (FD-ARMO) algorithm which can achieve 
a similar coding gain without the feedback as compared to 
the C-ARMO algorithms. The simulation results illustrate the 
advantage of the proposed ARMO algorithms by comparing 
them with the cooperative network employing the traditional 
DSTC scheme and the RSTC scheme. The proposed algo- 
rithms can be used with different DSTC schemes using the 
AF strategy and can also be extended to the DF cooperation 
protocol. 

Appendix A 

We show how to obtain the expression of the linear MMSE 
receive filter Wj [i] and the adjustable code matrix *& e g fc [*] in 
equation (0 and (O in Section III in the following. 

The MSE optimization problem is given by 
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We define a cost function associated with the optimization symbol vector r Rk o transmitted from the fcth relay node, 
problem above and expand it as follows By employing the AF cooperative strategy and space-time 

jgp - E rila-ffl - w H \i]r\i]\\ 2 ] + AfTrf* fil$ w [ill - P R ) C ° ding schemes at ±e rela y node ' the received s y mbo1 vector 

js- - -o [\\s 3 m w j l*j*-l*jh j + Aun*cq k [ij* oqk wj r R j at the relay nodes win be amplified and re . encoded prior t0 

= E [*i [*]*$[*]] - t»f [*]£ [r[i]*$[t]] - B [sj[i]r H [i]] «>,•[*] Hb«i§#^rM^]tHe^stination node. Let us first define the 

+ A(Tr(* oqk [i]*« k [i])-P R ), 



(39) 

where A stands for the Lagrange multiplier and should be 
determined before the calculation. It is worth to notice that 
the first, the third and the fifth terms are not functions of 
w^[i], so by taking the gradient of ££ with respect to w^[i] 
and equating the terms to 0, we can obtain 



amplified symbol vector before re-encoding as 

s SRk [i] = A RkD \i](F S R k [i]s[i] + n SRk [i]) = A RkD [i\F SRk [i]a[i] 

= F Rk \i}s[i\+n Rk \i], 

(43) 



+ A, 






-S[r[i]s*[i]] +E[r[i\r H [i\] Wj 



where Afl fc .D[i] denotes the N x N amplify matrix at the fcth 

relay node. The symbol vector ss Rk [i] will be mapped to an 

N x T space-time code matrix M(s), and multiplied by an 

0. (40) adjustable code matrix which is generated randomly before 

being forwarded to the destination node. By substituting (l43l ) 
By moving the first term in gD) to the right-hand side and by Mq ^ ^ relationship between all the relay nodes and the 

multiplying the inverse of the auto-correlation of the received destination node can be written as 

symbol vector, we obtain the expression of the linear MMSE 

receive filter as Wj[i] = R p, where the auto-correlation 

matrix R= E \r[i]r H [i\\ and the cross-correlation vector p = 

E [r\i}s*\i]]. 

In order to obtain the expression of the adjustable code 

matrix & eqk [i] we have to rewrite the received symbol vector 

r\i] as (44) 



r R D 



fe=i 



}G e q k \i](F Rk \i]s[i\ + n Rk [»]) + n RD [i] = J^ *e 9fc [*] 



fe=i 



fe=lj=l 



where the NT x N matrix Dull] contains all the channel 

_r-i_V^<5 i--if~i r-i- ri, r.-i _ V^ V"* <& ri„ inf^mation-ihetween-ithe source node and the fcth relay node, 

~- J k {- 1 r~^ kj e9 arid between the fcth relay node and the destination node. The 

fc=l k=l j = l J 

(41) noise vector at the destination node n_o[iJ is Gaussian with 

where * e ? k . [i] denotes the adjustable code matrix assigned to covariance matrix cr 2 (l + Tr(Y%Li $eq k [i]D k [i}))I N . By 

the jth received symbol S SRk [i] at the fcth relay node, and substituting <|4§ into ©, we can rewrite the MSE optimization 

9eq k H stands for the jth column of the equivalent channel problem as 
matrix G eqk [i] . By substituting d4lT i into d39l ), the expression 



r e<lk 

of _£f can be written as 



4> 



% =E [ Sj [i]s*[i]] - w»[i]E[(r + n RD [i])s*[i]] - E[ S] [i](wf[i\(r + n RD [i])) H ] 

+ E[( W m(f + n RD [i])) H w%Kr + n RD \i])] + A(Tr(* eqk [i]*" [i]) - P R ), 



arg mm 
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N 



where f = J2"k=i Ej=i *e, fcj . [Ag eqk] [i\s S R kj W- We do not 
have to consider the first and the second terms because they 
are not functions of ffr^ [i] so taking the gradient of ££ with 
respect to $* [i] these terms will disappear. The last three 
terms contain the sum of the adjustable code matrices, and we 
focus on the exact jth code matrix we need and consider the 
rest of the sum terms as constants. We can rewrite S£ as 



f [»]-«?[«](x;E*w*] d * 
fe=i j=i 

s.t. Tr(^$ oqkj [i]$« k] [i])<P R . 

j=i 

(45) 



By taking the instantaneous gradient of Jzf in 
respect to wf[i] and 3>^L [i] we can obtain 



with 



££ 
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i]K[#- [i\9 eqk .~ssR ki [i]f] + A(* egfc , m H eqk M^m VE [W S M rfm]\\ 2 ] wm = (*iH - wf[{\r[{\fV w; 



E[(wf[i\& eqk . [i\g eqki \i]s S R kj [Afw" [*]*e, tj . \i]a eqk] WssR k] [i]], 



-ejl'FW 



and by taking the gradient of _S? in 
respect to &* e „ k [i] an d equating the 

obtain $ 



(42) 
d42l with 

terms to 



VJSf* 



= V£ 



jV 



- ™f [*lE E *^ fcj [*]dAy [*]*j[»] + »»«D[*]) 
fc=l j=l 



zero, we can 
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P = E 



■eijfc. 



R-'P, 
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where 
and 
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s A l \ s SR k Ai\Wj[W 



e?fc,- 



Appendix B 



fc=i j=i 



e9 fc? M d fe J W S iW+ n flD 



(46) 



where e_j [*] = Sj[i] — w?[i]r[i] stands for the jth detected 
We show the detailed derivation of the C-ARMO SG error. By employing step sizes j3 and /x for the receive filter 
algorithm in this section. First, we have to rewrite the received and the code matrix recursions, respectively, we obtain the 
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C-ARMO SG algorithm derived as 

Wj[i + 1] = Wj[i] + f3(e*[i]r[i]), 

®e qkj [i] + Kcj [i]s*j [i]wj [»]<*£. [»]). 



*eg fcj [t + 1] 
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